This portfolio explores the albums Vulfpeck, their members and collaborators. Vulfpeck’s Joe Dart is praised as one of the best bassists to emerge in recent years, but that’s not the only reason for this choice. Firstly, Vulfpeck is known for their one-take style of recording and off-beat musical style. Secondly, the band has a few closely associated members that appear on a large section of their music. Finally, the band’s members have their own solo projects which often still feature each other. This allows us to compare the albums of these artists and see what makes their music unique. This is the main goal of the analysis in this corpus.
o
The natural groups in the corpus are as follows. Vulfpeck, Theo Katzmann, Woody Goss, Cory Wong, The Fearless Flyers and Nate Smith. Each of these categories consists of at least 3 albums and almost all albums feature the other artists. Though they share a band there are some clear differences in genre, Vulfpeck is primarily funk, Theo Katzman is more slow love songs, Cory Wong has the danceability of pop music, Woody Goss is minimalistic and has a more straight feel, The Fearless Flyers is extremely high energy and fast paced and Nate Smith’s solo work is odd-timed and syncopated heavy drumming.
o
It will be interesting to find out what makes each artists music truly theirs, even though they are so closely related and collaborative.
o
Below you will find a short description of each artists and a typical song of theirs.
v
Here we see the typical songs for each artist as mentioned earlier depicted by four track-level features. Some of the things I noticed myself are also representedin the graph.
The Fearless Flyers have the highest energy by far, while the love songs of Theo Katzman and the minimalist funk of Vulfpeck get low scores.
For Theo Katzmans song the valence is also the lowest, which often corresponds to sad songs, much like the melancholy often felt when listening to love songs.
The Dancability feature then seems odd in combination with the others. I would have expected Cory Wong to be the highest by far in this category.
Cory Wong scores high in Valence however, perhaps it was my personal correlation between dancing and happiness, which is often said to be represented by a combination of valence and energy in music.
Finally, Nate Smith scores lowest in Liveness. The track used is a solo drum track, which might explain the low score due to a lack of noise in the recording.
We are certainly gaining some insight into the differences between the artists, but these are just some typical tracks. Lets look at the bigger picture!
Here we see the entire dataset plotted based on valence on the x-axis and energy on the y-axis.
We can already begin to see some larger patterns. Most of the music is happy, and you can see that Fearless Flyers music actually never leaves the happy quadrant. Cory Wongs music also rarely does, but sometimes has a lower valence and ends up in theh angry quadrant.
You can clearly see Theo Katzman, Vulfpeck and Woody Goss are on the lower side of the graph. Overall they have less energy. For Theo Katzmann this is because of the style of music he usually plays, which is half-time smooth love songs. Woody Goss’ and Vulfpeck’s reason is most likely the minimalist production they use. Short notes and interesting rests in the music are something that makes their music uniqie.
Nate Smith is the only one who shows a clear pattern of low valence and high energy. This somewhat understandable, a lot of his music features heavy drum parts which in audio analysis often leads to high and intense peaks. It is difficult to descibe how one would express valence in terms of drums.
Tempo and Loundess seem to have little correlation. We can however see that for most artists songs that are high in both tempo and loudness have a higher chance of being in a major key.
Although the distribution is clearly not random, the modes used don’t seem to correspond with the, perhaps too simple, assumption that we can explain the emotion of the music using just the mode.
We can clearly see Theo Katzman uses almost exclusively Major keys, but still his songs were relatively low energy and valence compared to the other groups, which feature a lot more Minor keys.
Cory Wong, whose music I find the most uplifting of all, has relatively many minor keys.
One thing that is striking is the identical gap in keys between Vulfpeck and the Fearless Flyers for the keys of D# and E. Both these bands feature Joe Dart, perhaps he doesn’t like those keys very much?
Funnily enough, in his solo repetoire Cory Wong also doesn’t prefer d#, but does have a few songs in E.
We see the mode tempo for this dataset is 106 BPM. Compared to popular music, whose average tempo has risen up to 120 BPM in recent years, this is on the slower side.
There are also smaller peaks around 70, 90, 130, 155 and 170 BPM.
We can see that the graph is slightly left skewed which indicates that lower tempos are more constant, while higher tempos are more variable. .
.
.
.
.
.
We can start to see Tempo might be a pretty good predictor in certain cases. Nate Smiths Music has a clear peak around 90 BPM. The Fearless Flyers also havea high tempo as their mode, around 130 BPM, which corresponds to my expectation mentioned earlier. This also explains the peak in the earlier graph.
Although there seem to be clear areasfor each plot, we can see the means are very close together. This is likely caused due to the outliers you can see in almost every group except for the Fearless Flyers. .
.
.
.
.
0-20 seconds
The song opens with a bass melody that continous throughout the song.
100-110 seconds
A vocal solo
140-140 seconds
Vocal fills
In the timbre-based self similarity matrix on the left we see a bright glow indicating that, overall, the timbre of the song is not very constant This is quite a surprise, the song features a bass and keyboard that can be heard throughout the song. Perhaps the combination of instruments and vocals create sounds that the analysis simply couldn’t keep apart
The pitch-based self similarity matrix tells the opposite story. It is dark throughout, indicating a steady pitch, except for two key moments, which are caused by the vocal breaks. The vocals on this track are highly improvised which explains the clear difference with the repetitive melodies of the other instruments.
It’s interesting to see that the vocals have such a high influence on the overall structure of the song
.
.
.
.
.
.
.
.
.
.
.
.
.
.
0-40 seconds Main theme
Drum fill
50-70 seconds Main theme again
Drum fill
80-110 seconds Extended build up
120-140 seconds Drum solo
The song here is clearly divided in 4 parts. The first two parts are very similar, which is also what you hear. The playing style is the same and this is reflected in the dark squares. Each part is divided by a drum fill which differs from the rest of the song. It features three string instruments at the same time. The final part of the song is a drum solo, it has little similarity with the rest of the track.
As for the melody, the same pattern is visible, but in a much more exagarated manner. The largest bright lines occur when there string section goes from fast strumming to a melodic build up with longer notes. The drums seem to have a high influence in the pitch similarity still as even the final part has a high similarity with the rest, even though it features little strings. .
.
.
.
.
.
.
.
.
.
.
.
.
.
0-120 seconds Hi Hat grooves
120-150 seconds Ride Grooves
150-190 seconds Back to Hi Hat
Here we can really see the power of timbre analysis. Drums are a limited instrument in melodic terms, but the difference between different drums is something everyone can hear and in this case an amazing pattern occurs. The small lines are caused by hits on the crash cymbal, which is often used on the first count of a 4 or 8 bar groove. The clear dark squares are caused by the different cymbals used for the eigth notes played in the groove. The distinction between the smaller HiHat and the larger Ride creates a difference in pitch that is clearly picked up by the chromatic analysis.
As you might expect, the pitches don’t really say that much. He simply plays the drums for a solid three and a half minutes, so the only variation is between the different drums, but no clear pattern occurs there. Again the biggest brightness is due to the lack of cymbals and can be seen in the opening 10 seconds.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
0-60 seconds Verse and Chorus
60-120 seconds Verse and Chorus
120-180 seconds Verse and Chorus
180-240 seconds Verse and Chorus
The timbre analysis does not show a very clear structure. The third repetition features a more pronounced brass section, while the guitar takes a little break.
The most striking pattern is in the pitch features. The Bright lines are caused by breaks at the end of the chorus, going back into the verse. Here you can see this pattern of verse and chorus is repeated 4 times with similar sounds.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
0-200 seconds Soothing and rythmic guitar progression with a repeating vocal theme
The timbre feature seems to show very little similarity throughout the song, however, this is likely caused by the fact that the song is actually extremely similar.
The pitch analysis confirms the repetition I claimed above. The guitar progrression is a 4 bar loop, and the vocal melody on top of it starts each repetition with the same theme: “Love is a beautiful thing”. This exact repetition causes the diagonal lines in the graph
.
.
.
.
.
.
.
.
.
.
.
.
.
.
0-70 verse-prechorus-chorus
70-140 verse-prechorus-chorus
140-250 seconds A sing-along type vocal melody that is repeated
The timbre features again give little insight. The final part features a trumpet, which doesn’t occur throughout the rest of the song.
The strange checkerboard pattern that can be seen twice in is caused by the way the melody is built. It is an interplay between a bass on the down-beat and a high guitar chord on the up-beat. The final part is a repetitive vocal melody made to sing along.
To get the best classifier I tried both KNN and Random Forest to see which could achieve the best performance. Although the KNN was surprisingly effective, the random forest classifier did achieve the best
# A tibble: 6 x 3
class precision recall
<fct> <dbl> <dbl>
1 Cory Wong 0.561 0.667
2 Fearless 0.750 0.545
3 Katzman 0.607 0.548
4 Nate 0.722 0.448
5 Vulfpeck 0.5 0.622
6 Woody 0.412 0.292
When constructing the network one thing that drastically improved performance is using only the top 10 most influential features. Still there are some interesting observations to be made. Although this confusion matrix gives some insight, it is skewed towards the bigger categories like Vulfpeck and Cory wong, and it only shows numbers, and not a clear measure of performance. An analysis of the top features can be seen on the following page.
We can see that at least every category is the most predicted for each truth value. The Precision is especially high for Nate Smith and The Fearless Flyers. This likely because Nate Smith’s music is so drum heavy, and the Fearles Flyers music is guitar heavy. However, this precision comes at a price for Nate Smith, because the recall is low.
Woody Goss is the hardest predict, but there is a reasonable explanation for this. One of his albums is compilation of christmas covers of existing Vulfpeck songs. This means that these might not differ that much in pitch class, and since the Vulfpeck category is three times as large as Woody Goss’ category, it’s more efficient for the network to predict Vulfpeck.
.
.
.
.
.
.
This shows the most important distinguishing features between the different playlists. Loudness seems to be the key distinguishing feature, followed by instrumentalness and energy. The rest of the top 10 is mainly timbre components, while the key of the song doesn’t seem to contribute all that much to the predictions. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Here we try clustering in the 6 categories we defined at the beginning of this portfolio. As you can see the groups are certainly not pure. Nate smith seems to concentrate in the upper right, but is also an outlier in the blue cluster. The overlap between the middle clusters is also very large, which doesn’t really signify great discriminative ability.
[1] 228
If we look more closely at the clusters we can see that the clusters are quite good at distinguishing certain things.
If we look at cluster one we see a combination of Vulfpeck, official members Theo Katzman and Cory Wong, and guest artists Cory Wong and Nate Smith. It seems that this group captures the essence of average Vulfpeck Music.
Cluster two is a combination of Vulfpeck, Cory Wong and Fearless Flyers. Bassist Joe Dart and Guitarist Cory Wong both play in these bands. Perhaps this cluster captured that feature.
The Fearless flyers and Nate Smith are clearly the biggeset groups in cluster three. Nate Smith is the drummer on both of these and has a distinctive fusion funk style.
Cluster four is harder to pinpoint. It may just be based largely on Cory Wong’s solo repertoire, and have difficulty distinguishing between that and Vulfpeck.
Cluster five is also not very descriptive. The function used recommended using two clusters, clearly the groups defined by main artist is not the most distinctive.
Cluster six looks a lot like cluster 1, but the fact that Nate Smith is not included is telling. He joined later than the other members, and the relatively large proportions for Theo and Woody show clearly their style is closer to Vulfpeck’s combined style, which is to be expected as founding members.
In the previous slide we were shown the 6 clusters our random forest came up with. This is represented in our graph along two dimensions. In machine learning Principal Component Analysis is used to determine a linear combination of the features in our dataset that offer the highest rate of variability. In this circle we see the same two dimensions, but now plotted on top is the contribution of each of our features, the ones we know how to interpret. We can see that acousticness and energy are the biggest contributours for dimension 1, while c01 plays the largest role in dimension 2. It is through this combination of features that you can really start to distinguish groups. .
.
.
To say that this was the limit of what is possible in musical analysis would be plainly false. In this short portfolio I have tried to show through many ways and interpretations what it is that makes Vulfpeck and the surrounding artists special.
o
Overall, we’ve confirmed the stereotype that Vulfpeck is minimalist, a bit slower, a bit lower energy than the rest of them. Also Theo Katzman’s title as sappy love song writer can only be confirmed when compared to the others. Cory Wong’s enthusiastic music turned out to be using the most minor chords, this is likely because of the harmonious string and brass sections he features aswell.
o
One thing that surprised me was how much Spotifys track-level featured contributed towards the clustering and predictive algorithms. We built a random forest model that performed consistently, going down in accuracy mainly for underrepresented groups.
o
To anyone reading this, I Highly recommend checking out all of the artists mentioned in this portfolio. There is an amazing live show in Madison Square Garden that singlehandedly helped me through some tough lockdown moments, it’s good music and good laughs.
o
Thanks for reading!
v